Natural entropy fluctuations discriminate similar looking electric signals emitted from 

systems of different dynamics* 
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Complexity measures are introduced, that quantify the change of the natural entropy fluctuations 
at different length scales in time-series emitted from systems operating far from equilibrium. They 
identify impending sudden cardiac death (SD) by analyzing fifteen minutes electrocardiograms, and 
comparing to those of truly healthy humans (H). These measures seem to be complementary to the 
ones suggested recently [Phys. Rev. E 70, 011106 (2004)] and altogether enable the classification 
of individuals into three categories: H, heart disease patients and SD. All the SD individuals, who 
exhibit critical dynamics, result in a common behavior. 
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I. INTRODUCTION 



The problem of distinguishing electric signals which, 
although look to be similar, they are emitted from sys- 
tems of different dynamics, still attracts a strong inter- 
est. Two characteristic cases of major practical impor- 
tance are: First, Seismic Electric Signals (SES) activi- 
ties, which are low frequency (<1 Hz) signals of dichoto- 
mous nature that have been found in Greece [1-3] and 
Japan[4] to precede earthquakes, may look to be simi- 
lar to "artificial" noises (AN), which are electrical dis- 
turbances emitted from nearby man-made sources. It 
has been argued[l, 3, 5] that SES activities are emitted 
when the stress reaches a critical value in the EQ focal 
area. Second, sudden cardiac death (SD), which is the 
primary cause of mortality in the industrialized world [6], 
may occur even if the electrocardiogram (EGG) looks to 
be similar to that of truly healthy (H) humans. Sudden 
cardiac arrest may also be considered as a dynamic phase 
transition (critical phenomenon) [7, 8]. 

Both cases have been treated in Ref.[8], but here we 
only focus on the second one. The time-series will be an- 
alyzed in the natural time-domain. The natural time 
X is introduced [5, 9] by ascribing to the m-th pulse 
of an electric signal consisting of N pulses, the value 
Xm — Tii/N and the analysis is made in terms of the 
couple {Xm,Qm), where Qm denotes the duration of 
the m-th pulse. The entropy S in the natural time- 
domain[9, 10] is defined as S* = (xlnx) — (x) ln(x}: where 
(X Inx) = EfcLiPfcXfelnXfc , (x) = EfcLiPfcXfc and pfc 
Qk/'l2n=i dynamic entropy depending on the 

sequential order of pulses [8]. Here we calculate the value 
of S for a number of consecutive pulses and study how 
it varies within the recording (i.e., using a time- window 
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of certain length sliding, each time by one pulse, 
through the whole time-scries). Thus, for a window of 
length N.jj], when starting from the mo-th pulse, we have 
S{mQ,N^) = (xlnx>«; - {x)wln{x)w, where {x^nx)w = 
J2k=iPk:wXk,wlnxk,w , {x)w = Y.k=iPk,wXk,w with 

Pk,w — Qma~l + k/ ^nZlQma-l+n and Xk,w — k/Nw 

This variation is quantified by the standard deviation 5S 
{= 5SnJ of {^(mo,7V^),mo = 1, 2, . . . TV - 7V^}. The 
value of 5S may change to a different value SSshuf when 
repeating the same calculation but after shuffling the Qm 
randomly. In Ref. [8] we showed that a distinction be- 
tween SD and H can be achieved when calculating both 
SSshuf and SS at the same (time- window) length iV^, and 
then studying their ratio i5S's;i„//(5S' (which is labeled by 
J/) . Here we show that a similar distinction may be alter- 
natively achieved if we introduce appropriate measures 
that quantify the (55- variability upon changing the time- 
window length and, interestingly, their values approach 
the value of the Markovian case in SD, who exhibit crit- 
ical dynamics. Furthermore, we show that the measures 
suggested in this paper exhibit a certain type of comple- 
mentarity when compared to those discussed in [8] . 

In EGG, the turning points are traditionally labeled 
with the letters Q, R, S, T, see Fig.l(a). (cf. In Fig. 
1(b) we show, for example, how the QT interval time- 
series can be read in natural time). The RR- (beat-to- 
beat) and QRS-intervals (cf. mainly the RR) can be 
automatically detected[ll-14] (which was followed here) 
more easily than the QT-. In spite of this fact, we inten- 
tionally study here all these three types of intervals for 
the following reasons: It has been clinically observed that 
the QT interval usually exhibits prolonged values before 
cardiac death (see Ref. [15] and references therein). Inter- 
estingly, this clinical observation was found [8] to be con- 
sistent with the fact that in all SD the SS- (and SSshuf-) 
values themselves of the QT-intervals exceed those of H, 
see Fig. 2 (cf. the latter distinction between SD and H 
cannot be attributed to the allocation error of the QT 
interval, see Section VIII of Ref. [16]). Since the latter 
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FIG. 1: (Color) (a) Schematic diagram (not in scale) of a 
three heartbeat excerpt of an ECG in the usual (conventional) 
time-domain. Only the durations Qm,Qm+i,Qm+2 of the QT- 
interval (marked in each single cycle of the ECG correspond- 
ing to one heartbeat) are shown, (b) The QT-interval time- 
series of (a) read in natural time; the vertical bars are equally 
spaced, but the length of each bar denotes the duration of the 
corresponding QT-interval marked in (a). 



systematic behavior is not found when studying the RR- 
or the QRS- intervals [8], it is interesting to investigate 
here whether a systematicity occurs when employing the 
complexity measures suggested in this paper. Actually, 
we find that the latter measures seem to enable the dis- 
tinction between SD and H when using the RR- and QRS- 
intcrvals of the original time-series. Furthermore, and 
most interestingly, we pinpoint that, even when solely us- 
ing the most easily accessible values of the RR-intervals, 
such a distinction seems to be possible if we apply these 
measures to both the original time-series and the one 
obtained after shuffling the Qm randomly. We use here 
the QT Database from physiobank[17], which includes 
fifteen minutes recordings of 10 H and 24 SD (as well 
as recordings from four groups of heart disease patients, 
see below). Examples of the (5 S'- values, calculated for the 
RR-, QRS- and QT- intervals in the range 3 to 100 beats 
are plotted in Figs. 3(a) and (b) for one H and one SD, 
respectively. As for the symbols, we use the same con- 
vention as in Ref.[8], i.e., 5S is used only when the calcu- 
lation is made by a single time- window (e.g., 5 pulses), 
while the symbol 5S stands for the average of the 5S- 
values calculated for a sequence of single windows (e.g., 
3, 4 pulses). Finally, {5S) denotes the 55'- values averaged 
over a group of individuals, e.g., 10 healthy subjects. 

Before proceeding, however, it might be useful to reca- 
pitulate the main differences of our procedure compared 
to several other earlier attempts by other groups. The 
reasons why the concept of entropy should be preferred 
(compared to other quantities) as discriminating statis- 



tics in physiological time-series have been explained in 
detail in Ref.[8]. Furthermore, the advantages of using 
complexity measures based on dynamic entropy (and not 
on static entropy, e.g., Shannon entropy), as for example 
the Kolmogorov-Sinai entropy (K-S entropy), have been 
clarified[8]. Earlier attempts in the ECG analysis have 
actually used measures related to dynamic entropy. For 
example, the so called approximate entropy (AE)[18] or 
sample entropy (SE)[19] have been introduced and later 
used by other authors (e.g., see Ref. [20] where AE is 
applied beyond other measures; see also Ref. [16]). Also, 
Costa et al.[21] introduced the multiscale entropy (MSE) 
approach, the algorithm of which is based on AE or SE, 
calculating the entropy at different scales. As for the 
5, which is also a dynamic entropy, as already men- 
tioned, differs essentially from the other ones, because 
it is defined [9, 10] in an entirely different time-domain 
(see Fig. 1(b)). Moreover, the following has been found: 
When studying the 5- values themselves, most SES activ- 
ities can be clearly distinguished[10] from the majority of 
AN, because they have S-values smaller and larger, re- 
spectively, than the value Su = 0.0966 of the "uniform" 
distribution (as the latter was defined in Refs.[10, 22]); on 
the other hand, when dealing with ECG they all have S- 
values comparable, more or less, to 5„ [8], see also [16], 
thus not allowing a clear distinction among their prin- 
cipal categories (cf. the entropy values themselves have 
been used in earlier attempts). This is achieved, however, 
when we quantify the S fluctuations[8\ and use ratios of 
"shuffled" and "unshuffled" S fluctuations on fixed time 
scales [8] or ratios on different time scales that will be 
introduced here in Section II. Thus, in order to discrimi- 
nate similar looking electric signals emitted from systems 
of different dynamics, the following seems to hold: sig- 
nals that have S'-values more or less comparable to Su 
(which is the case of all ECG) can be better classified by 
the complexity measures relevant to the fluctuations SS 
of the entropy; if the S'-values markedly differ from Su 
(which is usually -but not always- the case of SES and 
AN), the classification of these signals should be prefer- 
ably made by the use of the S'-values themselves. 



II. THE NEW COMPLEXITY MEASURES 
PROPOSED. THE DISTINCTION BETWEEN SD 
AND H 

In classical Thermodynamics, the systems are stud- 
ied close to equilibrium and the relevant quantities have 
a clear physical meaning. In non-equilibrium systems, 
however, the meaning of entropy and its treatment should 
be handled with great caution (e.g., [1]), because there 
is at present (e.g., see Ref. [23]) no unified statisti- 
cal mechanical theory underlying these systems, (cf. 
In transformations between non-equilibrium stationary 
states, entropy might be a not well defined concept [24]; 
the connection of the entropy to microscopic dynamics is 
still a matter of intensive research (e.g., [25] and references 



3 



therein)). In complex systems operating far from equi- 
librium (like the case of heart dynamics[26]), long-range 
correlations play an important role (cf. such correlations 
are, of course, of prominent importance in equilibrium 
systems as well, when approaching a critical point, e.g., 
the "critical" temperature Tc, i.e., T — > Tc). Thus, in 
the latter systems both correlations (i.e., short- and long- 
range) , in general, is advisable to be studied carefully and 
hence appropriate complexity measures should be envis- 
aged. This is, in simple words, the physics underlying the 
present paper and stimulated the procedure followed. 

Along these lines, we introduce the ratios 
5Si{RR)/SSj{RR), SS,{QRS)/SSjiQRS) and 
6Si{QT)/6S/{QT) for the RR-, QRS- and QT-intervals, 
respectively, where i,j denote the time- window length 
used in the calculation of SS. Assuming that j < i, 
these three ratios provide measures of the SS- variability 
when a scale i changes to a scale j. We select as a 
common scale (for all RR-, QRS- and QT-) the smallest 
j-value allowed for the natural time-domain analysis, 
i.e., J = 3 beats, and for the short range (s) i = 5, 
while for the longer (L) i = 60 beats. Thus, the 
following ratios are studied: Xs{t) = (55*5 (t)/(5S'3 (r) and 
Al(t) = SSqo{t)/SS3{t), where r denotes the type of 
interval, i.e., r =RR, QRS or QT. We also define the 
ratios Pi{T) = SSi{RR)/SSi{T), which provide a relative 
measure of the 55'- values of the RR-intervals compared 
to either QRS- or QT- (for the same number of beats i). 
Here, we will use for the short range Ps{t) = P3{t) and 
for the long range /3l(t) = peoir). 

The calculated values for the complexity measures 
AkjPk (where k denotes either the short, k = s, or the 
longer, k = L, range) are given, for all H and SD, in Table 
I. The minima minij[AK(T)] and maxima max//[AK(T)] 
among the healthy individuals for the RR (r =RR) and 
QRS (r =QRS) intervals are also inserted in this Table. 
We also include the corresponding minima minjy[/3K (■'')] 
and maxima maxHipKiT)] for (the relative (55- variability 
measure) p. For the sake of simplicity, they are labelled 
Hmin and Hmax, respectively (and jointly named H- 
limits). The superscripts 'a' and 'b' show the cases of 
SD which have smaller and larger values than Hmm and 
Hmax, respectively. In two individuals, i.e., sel41 and 
sel51, it is uncertain whether their measure Xs{QRS) vi- 
olates the value iJmi„=1.15. 

Table I reveals that all SD violate one or more 
if-limits of XsiRR), Xl{RR), PsiQRS) and pl{QRS), 
and hence can be distinguished from H. In other words, 
the (55-variability measures of the RR-intervals, together 
with their relative ones with respect to the QRS- (i.e., 
four parameters in total), seem to achieve a distinction 
between SD and H. Note that Xi^{RR) alone can clas- 
sify the vast majority of SD. Furthermore, attention is 
drawn to the point that if we also consider the Ak(t) 
values calculated ( not in the original, but) in the ran- 
domized ("shuffled") sequence of Q™, we find that all SD 
violate one or more H-limits of Xk{RR) and X^^shufiRR) 
(see Table VII of Ref.[16]). This allows (cf. using again 
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FIG. 2: (color) (a) The 5S{QT)-va.\ue for each of the 24 SD 
and 10 H (see Table I) and (b) the average of the ^^(QT) 
values -designated by {SS{QT)}- along with their standard 
error deviation for each of the two groups SD and H versus 
the time-window length. 



four parameters in total) the distinction of SD from H by 
using the RR-intervals only. 

Thus, we found that among the 10 parameters defined 
in the original time-series extracted from each ECG (or 
20 parameters, in total, if we also account for the corre- 
sponding parameters defined in the series obtained after 
shuffling the Qm randomly), only four are required for 
the distinction between SD and H. We clarify that this 
seems to be extremely difficult to be achieved by chance. 
In order to visualize it, if we assume (for the sake of con- 
venience only) independent and identically distributed 
(iid) distributions of the parameters for one subject, we 
find that the probability that all 4 parameters are within 
the bounds (minima and maxima) set by 10 other sub- 
jects (i.e., the healthy ones) is (1-2/11)-* « 0.448. Thus, 
the probability that all 24 additional subjects are classi- 
fied as SD by pure chance is (1 - 0.448)^4 « 6.4 x 10"^ 
i.e., extremely small. Concerning the validity of this sta- 
tistical argument, we clarify that it does not remain valid 
if one just picks 4 parameters out of the original 20 ones. 
Only if one decides which parameters wants to use before 
the calculation of the values, the argument is valid(This 
is the reason why blind evaluation- defining all methods, 
parameters and criteria studying one set of data, and 
then testing the significance using an additonal set of in- 
dependent data- is considered very important in medical 
applications and/or publications). 

We now attempt a physical interpretation of the 
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FIG. 4: (Color) The average (denoted by the brackets) val- 
ues of (a): the &S(RR) and (b): &S{RR) j &Sz{RR) for the 
SD (solid black) and H (red circles) versus the time-window 
length; the bars correspond to the standard error of the mean. 
The results for a Markovian time-series are also plotted (green 
squares), but the bars here denote the standard deviation. 



FIG. 3: (Color) The ^S-value versus the time-window length 
for one H (a) and one SD (b). Intervals: QT (solid red), QRS 
(broken green), and RR (dotted blue). 



present results, the main feature of which focuses on 
the fact that both ratios Xs{RR) and Xl{RR) become 
smaller, in the vast majority of SD, compared to H. 
Recall that the (55'(i?i?)-values themselves cannot dis- 
tinguish SD from H, see Fig. 4(a), in contrast to the 
ratios SS^{RR)/6S3{RR), see Fig. 4(b). Before pro- 
ceeding, we mention two points: First, for individuals 
at high risk of sudden death, fractal organization (long 
range correlations) breaks down (see Refs.[26, 27] and 
references therein). The breakdown of fractal physio- 
logic complexity is often accompanied by emergence of 
uncorrelated randomness or excessive order (e.g., peri- 
odic oscillations appear in the heart rate recordings of 
"frequency" «l/min, which are associated with Cheyne- 
Stokes breathing) [26]. Second, if we calculate[8, 10] the 
55'- values in a (dichotomous) Markovian {A4) time-series 
(exponentially distributed pulses), for a total number of 
N=10^ pulses (i.e., length comparable to that of the ECG 
analyzed here), we find that these values: (a) lead to 
A^(A^) = 1.20 ± 0.03 and (b) differ drastically, see Fig. 
4(a), from the (5S'(i?i?)-values themselves of both SD and 
H (thus indicating that they exhibit non-Markovian be- 
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FIG. 5: (color) The average of the SS{QT) values -labeUed 
{SS{QT))- for each of the six groups labelled H, MIT, MSV, 
MST, EST and SD (see the text) versus the time-window 
length. The bars denote the standard error of the mean. (The 
corresponding standard deviations overlap considerably and 
hence are not shown for the sake of clarity). The lowermost- 
and the uppermost- curve correspond to H and SD, respec- 
tively and hence coincide with the two curves depicted in Fig. 
2(b). 
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havior on the whole; this is consistent with the aspects 
that bodily rhythms, such as the heart beat, show com- 
plex dynamics, e.g., [26, 27]). The fact that Xs{RR) in 
SD becomes smaller than in H can now be understood 
as follows: Since H exhibit a high order of complexity, it 
is expected that (even) their ffmi„-value (=1.43), should 
markedly exceed Xs(M-). On the other hand, in SD this 
high complexity is lost, and hence their Xs{RR) values 
naturally approach Xs{A4), thus becoming smaller. In- 
terestingly, the SD average value of (RR) in Table I is 
1.19, i.e., coincides with Xs{M). (Such a coincidence also 
occurs for the QRS-intervals in both H and SD, which 
agrees with the observations[15] mentioned above that 
the prolonged QT-intervals in SD mainly originate from 
enlarged ST-values, while their QRS- may remain the 
same.) We now proceed to the interpretation of our re- 
sults related to the ratio Xl{RR)- In H, it is expected 
that (in view of the RR long range correlations [26]) the 
corresponding values must be appreciably larger than 
Xl{M) = 0.64 ± 0.05, calculated in the Markovian case 
(Fig. 4(b)). We now examine the SD: If, in SD "un- 
correlated randomness" appears, this reflects that their 
Xl{RR) values naturally approach Xl{A4), thus becom- 
ing smaller (compared to H); this actually occurs in the 
vast majority of SD in Table I. // in SD the aforemen- 
tioned periodicities appear, it is naturally expected to 
find large (see Ref.[16]) 55- values when a time- window of 
length around 60 beats, or so (i.e., related to the afore- 
mentioned "frequency" «l/min) sweeps through the RR 
time-series, thus resulting in 55- values even larger than 
those in H (since in H no such periodicities appear). The 
latter actually occurs in the few cases marked with su- 
perscript 'b' (i.e., those exceeding Hmax) in Table I (For 
additional arguments on the interpretation see [16]). 

The fact that the overall behavior of the complexity 
measures introduced in this paper (i.e., clear distinc- 
tion of SD from H) is more or less similar to that of 
the measures discussed in Ref.[8] does not mean that 
the former measures are similar to the latter, because, 
as we shall explain below they exhibit a certain type of 
complementarity in the following sense: if in the frame 
of the one procedure an ambiguity emerges in the dis- 
tinction between SD and H, the other procedure gives 
a clear answer. (Recall that, as mentioned in Section I, 
in Ref.[8] we discussed entropy fluctuations -and ratios 
of "shuffled" and "unshuflled" entropy fluctuations- on 
fixed time scales, while here we study entropy fluctua- 
tions on different time scales.) This is consistent with 
the findings of Ashkenazy et al.[28] that an approach 
dealing with ratios on the same time scale and an ap- 
proach dealing with ratios on different time scales (or 
corresponding scaling exponents) are somewhat comple- 
mentary. We now study, as an example, the following 
two procedures: i.e., the one that uses (55(QT)[8] and 
the other which combines the measures X,p. The -values 
of SD and H given in the last column of Table I are classi- 
fied into two classes: the larger values correspond to SD, 
and the lower ones correspond to H (see also Figs. 2 and 



5 ) . Let us focus on the two lowermost SD values and the 
uppermost H value. The former two correspond to scl33 
and sel34 553-4 (QT)=0.00076 and_0.00069, respectively) 
and the latter one_ to sell6795 (553-4(QT)=0.00056). 
In view of their 553_4(QT)-values proximity, one may 
wonder whether these two SD could be confused with 
H. This ambiguity can be dissolved in the light of the 
other procedure (i.e.. A, p), as follows: Table I reveals 
that sel33 markedly violates both the _ffmj„-limit for 
As (QRS) as well as ifmm for As(RR) (the latter can 
be visualized in Fig. 6). As for sel34, the i?„a2,-limit 
of Al(QRS) is strongly violated. We now turn to an 
alternative example, i.e., sel47, who, by means of the 
method using the complexity measures A, p (of the RR- 
and QRS-intervals), could be confused with H, because 
a deviation of only around 12% from the ifmin-finiit of 
min//[ps(Q_R5)] = 0.18 is noticed. This ambiguity can 
be dissolved by means of the procedure using 55(QT) 
as follows: sel47 has 553_4(QT)=0.0029 , which exceeds 
significantly, i.e., by a factor 5, the corresponding value of 
sell6795, who has the largest 553_4(QT)=0.00056 value 
among the H. 



III. THE PROCEDURE TO DISTINGUISH SD 
FROM PATIENTS 

This Section aims at distinguishing SD from patients, 
where the latter terminology refers to individuals suffer- 
ing only from heart diseases. The QT-Database of phys- 
iobank we use here, includes the following four groups 
of patients (a fifth group that consists of 4 individuals 
only, was disregarded for the reasons discussed in Ref.[8]): 
15 individuals from MIT-BIH Arrhythmia Database (la- 
beled hereafter MIT), 13 from MIT-BIH Supraventricu- 
lar Arrhythmia Database (MSV), 33 from the European 
ST-T Database (EST) and 6 from MIT-BIH_ST change 
Database (MST).jrhe values of A, p, u,SS3-i{QT), 
Xshuf, pshuf and 5S3-4.shuf{Q,T) of all these patients 
are given in Ref. [16]. 

An inspection of the measures A, p, v shows three facts: 
First, all SD and all patients violate one or more H- 
limits. Second, none of the measures A, p, v alone, or a 
combination of two of them, can effectively differentiate 
the SD from the patients. Third, if we consider the three 
measures A, p, v (i.e., 16 parameters) altogether, we find 
that twenty SD out of 24 violate some of the limits of both 
patic;nts and H, thus allowing in principle a distinction 
of the vast majority of SD from the other individuals. 
Thus, in summary, the consideration of the quantities 
(A, /3, v) only, docs not lead to a distinction between all 
SD and patients. The same conclusion is drawn if we 
alternatively consider the quantities (A, Xshuf ,p) only. 

We now turn to the investigation of the 55(QT) values. 
In Fig. 5, the average (55) (QT) value for each group is 
plotted versus the time-window length. It is intriguing 
that the results of the four groups (MIT, MSV, MST, 
EST) of patients are located between H (the lowermost 
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curve) and SD (the uppermost curve). We emphasize, 
however, that if we plot the curves for each one of the 
101 individuals (in a way similar to that of Fig. 2(a)), 
we find that there are some patients the results of which 
overlap with either SD or H. Let us consider only the 
limiting cases -i.e., the lowermost and the uppermost 
curve, to be called hereafter 6S{Q,T)min and SS{QT)max, 
respectively- obtained in each groups of patients. In or- 
der to distinguish SD from patients, we must appropri- 
ately discriminate the overlap which refers to those of the 
patients that lie above the uppermost ^5(QT) curve of H; 
the latter curve from now on will be called SS{QT)rnax.H- 
Thus, the limits of the patients we are currently inter- 
ested in, do not extend from SS{Q,T)min to SS{QT)max, 
since they must exceed SS{Q,T)max,H, i.e., 

5S{QT) > 6S{QT)max,H. (1) 

The curve which corresponds to the one of the patients, 
that has SS{Q,T) lying just above the SS{QT)max,H cor- 
responds to a value, which will be labeled hereafter 
SS{QT)min'- Thus, if we apply the condition 

<55(QT) w < SS{QT) < SS{QT) 

max (2) 

to each group of patients, we are left only with those of 
the patients that actually overlap with SD. 

We now recall that, as mentioned above, the measures 
A, p, V altogether, which are in fact ratios of 5S values, 
enable the discrimination of the vast majority of SD from 
all the others (i.e., patients and H), while the 5S'(QT) val- 
ues themselves efficiently distinguish [8] all SD from H. 
This motivates us to investigate whether a proper com- 
bination of these two facts can serve our purpose, which 
refers to the identification of all SD among the other in- 
dividuals (patients and H). Thus, we now compare the 
quantities A, p, (5S'(QT) altogether, of each SD, to the 
corresponding parameters of only those among the pa- 
tients that happen to have ^5(QT) values exceeding the 
corresponding values of H, i.e., obey the condition (1), or 
preferably the more accurate condition (2). Such a com- 
parison reveals that some of the 17 parameters of A, p, 
(55(QT), in all SD, lie outside the limits of these patients 
(cf. the same happens, of course, if we compare each SD 
to the limits of H). These results point to the conclusion 
that all 24 SD are distinguished from the patients (and 
H). The same conclusion is drawn if we consider instead, 
the 17 parameters A, Xghuf, P, 6S{Q,T). We emphasize, 
however, that the study of the estimation errors (see the 
Appendix) reveals that the confidence level for the dis- 
tinction of all SD from the patients becomes appreciably 
larger if we combine all the measures A, Xshuf, P Pshuf, 
V (of all intervals) with the condition (2) applied to both 
(55(QT) and ^SshujiS^^ (i-e., in reality, we then con- 
sider the limits of those patients whom both (5S'(QT)- and 
(5S'sft,„/(QT)-values are larger than those in H, as shown 
in Fig. 6 of Ref.[8]). 

We finally comment on three points. First, once the 
identification of SD has been completed, the distinction 



between patients and H can be made by identifying as pa- 
tients the individuals whom one or more of the aforemen- 
tioned parameters violate the iJ-limits. Second, since it 
is known that heart rate variability depends strongly on 
the age, it is highly recommended that when compar- 
ing values of the aforementioned complexity measures, 
the corresponding limits should be taken from subjects 
(patients, H) of comparable age. Third, we now focus 
on the importance of the sequential order of on the 
aforementioned complexity measures. We prefer to deal 
with the results related to the RR-intervals since it is 
known that the healthy heart beats irregularly and that 
the intervals between beats (i.e., the RR-intervals) fluc- 
tuate widely, following complicated patterns [29]. Let us 
investigate, for example, the possibility of using Ak(RR) 
alone to distinguish the SD as well as the four groups 
of patients from H, i.e., examine whether the Ak(RR)- 
values of each individual violate one (at least) of the rel- 
evant J?-limits. The results show (see Table II) that the 
vast majority of SD and of each group of patients is well 
distinguished from H by means of Ak(RR). The situa- 
tion drastically changes, however, if we use, instead of 
Ak(RR), the A^.s/iuj-values (see the Tables V to VII in 
[16]): only the minority of SD and of each group of pa- 
tients can be differentiated from H. Since the calculation 
of the Ak (RR)-values takes into account the sequential 
order of Qm, while the AK,s/i„/(RR)-values do not, this 
points to the following conclusion: It is the sequential 
order of beats that inherently contains the primary in- 
formation which enables the distinction between the SD 
and patients, on the one hand, and the H, on the other. 
This might explain why procedures based on the entropy 
in natural time (which is dynamic entropy, affected by 
the sequential order [7, 8], see Section I) -and hence they 
consider the complexity measures mentioned in the pre- 
ceding Sections- can achieve such a distinction, while a 
static entropy (e.g.. Shannon entropy, see Ref. [8]) can- 
not. 



IV. CONCLUSIONS 

First, in SD, the 55- values depend on the length scale 
in a way significantly different from that in H. Hence 
these two groups of humans can be well distinguished. 
Second, the SD, who exhibit critical dynamics, have A- 
values (being, in fact, ratios of 55-values, as mentioned 
above) which approach those of the Markovian case. This 
should not be misinterpreted as showing that the corre- 
sponding time-series are of Markovian nature, because 
the (55-values themselves are approximately one order 
of magnitude smaller than those of the (dichotomous) 
Markovian time-series (sec Fig. 4(a) and Ref. [8]). 
Third, the quantities A, Xshuf i Pi Pshuf, i', (5iS'(QT) and 
5Sshuf{Q^^) altogether, seem to enable the classification 
of individuals into the three categories: H, patients and 
SD. 



7 



APPENDIX: THE INFLUENCE OF THE 
ESTIMATION ERRORS ON THE PROCEDURES 
FOR THE DISTINCTION OF SD 

Beyond the error introduced by the use of an auto- 
matic threshold detector for the alloc;ation of the corre- 
sponding intervals (cf. this is largest for the QT- and 
smallest for the RR-intervals), the following two sources 
of errors must be considered [7, 8]: First, an estimation er- 
ror emerges when analyzing -instead of the original time 
series of length I w 10^- smaller lengths I', which, how- 
ever, still signific;antly exceed the time-window lengths 
used, for example T « 2 x 10^ (the errors associated with 
the measures in the short range, s, are smaller from those 
corresponding to the longer range, L, because for the lat- 
ter range the I /I' values -due to the restricted length of 
the records available- are small, thus not allowing more 
reliable statistics). Second, a source of (statistical) er- 
ror in the results emerges when considering the ratio(s) 
SSshuf /SS (i.e., when dealing with ;y and Xshuf) instead 
of 5S itself. While 6S may be considered to have a unique 
value for a (given) original Qm time-series, the value of 
SSsh u f depends on the randomly shuffled Q„i series each 
time selected (cf. such differences are well known[30] 
when dealing with randomized series of finite length). 
This is why the z^- values given in Ref. [8] for SD and H 
do not fully coincide with those tabulated in the present 
paper. To account roughly for the extent of this statisti- 
cal error, we averaged here the SSshuf-^sl^^s calculated 
over a number (e.g. 20) of randomly shuffled Q^-series 
generated from the same original scries and then the cor- 
responding standard deviation was estimated. 

The final results on the above sources, could be sum- 
marized as follows: The (percentage) estimation error 
was found to be around 10% (cf. this is an average value) 
for the complexity measures X, Xshuf , P, Pshuf, ^ asso- 
ciated with the RR- and QRS-intervals. Furthermore, 
since the error in the (55(QT) may reach 20%, the esti- 
mation error in those of the complexity measures that 
involve ^^(QT) may be as high as « 30%. Upon consid- 
ering such error- levels, hereafter called "plausible estima- 
tion errors" fp, a study of each of the methods for the 
distinction of SD was made. The study was repeated by 
assuming larger (percentage) estimation errors, hereafter 
labeled "modified estimation errors" e™, calculated from 

, Hfnax Hrnin \ / \ i\ 

TTf — (^-^^ 

for each parameter (see Table VIII in Ref. [16]). Both 
studies led, more or less, to the same results. The cal- 
culation, in each study, was made as follows: Each pa- 
rameter was assumed to be equal to its value (initially 
estimated from the original time-series available) multi- 
plied by a number randomly selected in the range 1 ± 
or 1 ± Cm, respectively) and then each of the methods for 
the distinction of SD was applied. This application was 
repeated, for each method, lO'' times via Monte Carlo 
and relevant conclusions have been drawn for both stud- 



ies. The extent to which these conclusions hold, was also 
investigated in the following extreme case: the limits of 
the parameters of H (and patients) , which arc automat- 
ically adjusted for each "random" selection of the val- 
ues described above, have been assumed to additionally 
relax by (extra) amounts equal to Cp or e„i. (Such a 
"relaxation" faces the extreme possibility that the popu- 
lations of H and patients treated here are not considered 
large enough to allow a precise determination of their 
limits, and hence future increased populations' studies 
could somehow broaden these limits by extra amounts as 
large as ep or £„). 

The following conclusions were finally drawn concern- 
ing the distinction between SD and H (see also Table 
III): Among the four methods suggested (i.e., two in 
Ref. [8] and two in Section II), the one that uses the mea- 
sures A, p (associated, however, with all three types of 
intervals, i.e., 10 parameters in total) seems to be robust 
in the following sense: when assuming the error-levels 
mentioned above, the use of A, p still allow with a confi- 
dence level above 99% the distinction of all SD from H. 
( Then a calculation similar to that given in Section II 
concerning the probability that all 24 subjects are clas- 
sified, by means of 10 parameters, as SD by pure chance 
-based on the limits set by 10 other subjects- results in 
[l-(l-2/ll)i°]24w0.03,i.e.,too small. ) The confidence 
level decreases to 63%, 49%, 32% and 59% . respectively, 
when using four parameters or one parameter only as 
follows: First: A„(RR) and ^^(QRS); second: Ak(RR) 
and XK.,shuf(RR-)', third: z/„(RR) and z^„(QRS); fourth: 
SS 3-4{QT). If we investigate the aforementioned extreme 
case of the additional "relaxation" of the H-limits, the 
capability for the distinction of all SD still remains with 
the following results: In the case of using A, p (of all 
intervals), the confidence level in distinguishing all SD 
is 88%, while it becomes appreciably higher, i.e., larger 
than 99%, if we use the quantities A, p, X^fi^f, Pshufi 
v, SS3-4{QT), 5S^-4,^shuf{QT) altogether. When using, 
however, four parameters only in the first three combi- 
nations mentioned above, the confidence level decreases 
to 90%^ 36% and 8% , respectively (and to 77% when 
using (553-4 (QT)), even when allowing two at the most 
SD -out of 24- to be misinterpreted as being H. As for 
the corresponding conclusions related to the distinction 
of SD from the patients, these can be drawn on the basis 
of the values given in the lower part of Table III. 

In summary, the study of the estimation errors reveals 
that {if the limits of the parameters will nothe broadened 
by future investigations) we can satisfactorily distinguish 
the totality of SD from H as well as discriminate the total- 
ity of SD from patients, upon employing the quantities 
A, Xshuf, P, Pshuf, iy,6S3-4{Q,T), (55'3_4,s/i„/(QT) alto- 
gether, i.e., the sixth and the last method, respectively 
in Table III. These quantities also allow the distinction 
of the totality of SD from H (as well as distinguishes the 
vast majority of SD from the patients), even if their 
limits will be eventually broadened (by £„). 

The following remark should be added concerning the 
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number of parameters required to achieve the desired 
distinction: In reaUty, only twelve independent quanti- 
ties, (i.e., the six: SSk{t) and the six SSK,shuf{T), where 
K=s,L and t=RR,QRS, QT) are extracted from the ex- 
perimental data. Thus, for example, beyond 6S3-4{QT) 
or 5Sz-A,shuf{QT)^ eleven additional parameters (out of 
26) of the ratios: A, Xshuf, P, Pshuf, are in princi- 
ple required to be used for the distinction. These twelve 
quantities , however, should not be fortuitously selected, 
but the following points must be carefully considered: 
(i) priority should be given to the eight parameters as- 
sociated with A-values and Xshuf- (or v-) values of RR 
and QRS, (ii) using, at least, one p-parameter (involv- 



ing (553-4 (QT) or SS3-4^shujiQT)), and (iii) examining 
whether the totality of the parameters used can actu- 
ally reproduce the aforementioned twelve JS'-values de- 
termined directly from the data. However, in order to 
avoid the difficulty arising from the completeness (or not) 
of the aforementioned selection, at the present stage (i.e., 
until an appreciably larger number of H and patients 
will be analyzed to allow a better precision in the de- 
termination of the corresponding limits), the preceding 
paragraph recommends to use -instead of twelve- all the 
28 parameters associated with the quantities A, Xshuf, P, 
Pshuf, f, 5Si-i{QT) and SSs-4,shufiQT). 
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TABLE I: The variability measures (A), the relative ones (p), and the ratios v = SSshuf /SS in the short (s) range and in the longer (L) range in H (sell6265 to 
sell7453) and SD (sel30 to sell7152) along with their M3-4(Qr)-values. 
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1.16 


0.50 


1.27 


0.50 


1.08 


0.71° 


0.86° 


1.15' 


0.96 


1.21'' 


0.71° 


1.11'' 


3.96'' 


sel50 


1.32° 


0.59° 


1.28 


0.46° 


1.21 


0.32° 


1.78 


2.31 


1.21 


2.26 


1.07° 


1.00 


0.91 


0.93'' 


1.20'' 


1.62'' 


5.21'' 


sel51 


1.83 


0.72° 


1.14° 


0.42° 


1.24 


0.66 


0.16° 


0.27° 


0.30° 


0.33° 


1.30 


1.04 


1.00 


1.05'' 


1.24'' 


0.90 


1.83'' 


sel52 


1.40° 


0.73 


1.32' 


1.02'' 


1.29 


1.01 


0.14° 


0.10° 


0.42° 


0.31° 


1.51 


1.13'' 


1.17 


1.02'' 


0.73° 


0.67 


1.66'' 


sell7152 


1.06° 


0.93° 


1.31' 


0.58 


1.13° 


0.54 


0.06° 


0.10° 


0.23° 


0.40° 


1.68 


1.01 


1.03 


0.91' 


1.01 


0.97 


1.15'' 


mm 


0.81 


0.11 


0.96 


0.42 


1.02 


0.32 


0.06 


0.10 


0.11 


0.25 


0.84 


0.92 


0.90 


0.41 


0.68 


0.63 


0.69 


max 


1.87 


3.04 


1.52 


1.22 


1.37 


1.01 


2.35 


3.40 


2.41 


4.12 


2.79 


1.24 


1.42 


2.93 


1.46 


1.62 


6.45 



"These values are smaller than the Hmin given in each column 
''These values are larger than the Hmax given in each column 
"^These values do not fully coincide with those given in Ref.[8] for 
the reasons discussed in the Appendix 
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TABLE 11: The number of SD and patients that can be dis- 
tinguished from H when using Ak(RR) or \^.shuf(RR-) alone. 



group 


Total number 


(RR) 


A„,.h„/(RR) A, 


(RR) and A„.afe„/(RR) 


SD 


24 


23 


10 


24 


MIT 


15 


14 


6 


14 


MSV 


13 


13 


2 


13 


EST 


33 


29 


8 


29 


MST 


6 


5 





5 




FIG. 6: (color) The (5S'3-4(QT) values along with those of 
As(RR)- and Al(RR)- for SD (red) and H (black). The in- 
dividual sel33, who is discussed as an example in the text, is 
marked with a green column. 



TABLE III: The confidence levels to distinguish SD from either H or patients when considering the estimation errors em discussed in the Appendix and given in Table 
VIII of Ref.[16] 



Method Employed Confidence levels to distinguish SD 



Using the limits Using 
from the data analyzed broader limits^ 



Aim 


Measures 


Type of 


No. of 


All SD 


All but 


All but 


All SD 


All but 


All but 


All but 






intervals 


para- 




one SD 


two SD"* 




one SD 


two SD 


five SD'' 








meters 


% 


% 


% 


% 


% 


% 


% 


Distinction 


A, p 


RR. QRS, QT 


10 


>99 


>99 


>99 


88 


99 


>99 


>99 


of SD 


A, p 


RR, QRS 


4 


63 


95 


>99 


8 


43 


90 


>99 


from H 


^shuf 


RR 


4 


49 


90 


99 


1 


11 


36 


97 




1/ 


RR,QRS 


4 


32 


74 


96 


<0.5 


1 


8 


60 




SS3-4(QT) 


QT 


1 


59 


93 


>99 


11 


39 


77 


>99 




A, p.Asft, Psh, 1/, 


RR, QRS, QT 


28 


>99 


>99 


>99 


>99 


> 99 


>99 


>99 




SS3-4{QT),SSsh,3-4iQT) 




















Distinction 


A, p, ly, SS3-4{QTy 


RR, QRS, QT 


17 


51 


83 


95 


<0.1 


<0.1 


<0.1 


1 


of" SD 


A, p, 5S3~4{QTY 


RR, QRS, QT 


17 


62 


91 


98 


<0.1 


<0.1 


<0.1 


1 


from patients 


A, p, Xsh, Psh, V, 


RR, QRS, QT 


28 


95 


>99 


>99 


16 


41 


68 


98 




SS3-dQT), 






















SSsh.s-iiQT)" 





















''Considering the limits of those patients that have 5S3-a{QT) 
larger than those in H 

''Considering the limits of those patients that have both 
553-4 (QT) and 5Ss}i^3—4,{QT) larger than those in H 

"^by amounts tm given in Table VIII of Ref. [16] 

''When stating, e.g., "All but one", it means that when allowing 
at the most, one SD -out of 24- to be misinterpreted as being H or 
patient, respectively 



